Hybrid MPI/OpenMP parallelization in FETI-DP methods

نویسندگان

  • Axel Klawonn
  • Martin Lanser
  • Oliver Rheinbach
  • Holger Stengel
  • Gerhard Wellein
چکیده

We present an approach to hybrid MPI/OpenMP parallelization in FETIDP methods using OpenMP with PETSc in the finite element assembly and using the shared memory parallel direct solver Pardiso in the FETI-DP solution phase. Our approach thus uses OpenMP parallelization on subdomains and MPI in between subdomains. We investigate the efficiency of this approach for a benchmark problem from two dimensional nonlinear hyperelasticity. We observe good scalability for up to 4 threads for each MPI rank on a state-of-the-art Ivy Bridge architecture and incremental improvements for up to 10 OpenMP threads for each MPI rank.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs

The parallelization process of nested-loop algorithms onto popular multi-level parallel architectures, such as clusters of SMPs, is not a trivial issue, since the existence of data dependencies in the algorithm impose severe restrictions on the task decomposition to be applied. In this paper we propose three techniques for the parallelization of such algorithms, namely pure MPI parallelization,...

متن کامل

Parallelization of the Treecode Algorithm for N-Body Simulation Using MPI, Hybrid, and GridRPC Programming Paradigms

This dissertation describes the parallelization of the treecode algorithm for N-Body problem and performance comparison among three different parallel programming paradigms, MPI, hybrid MPI-OpenMP, and GridRPC. In N-Body simulation, the specific routine for calculating the forces on the bodies which accounts for upwards of 90% of the cycles in typical computations is eminently suitable for obta...

متن کامل

Parallelization of a finite difference scheme for solving systems of 2D Sine-Gordon equations

Systems of perturbed 2D Sine-Gordon equations coupled via a cyclic tridiagonal matrix are solved numerically by a second-order centered finite difference scheme. The systems are considered on rectangular domains. First an OpenMP parallel program is realized and very good performance scalability inside one computational node is achieved. The tests on one computational node of the CPU platform in...

متن کامل

Efficiency of Large-scale Cfd Simulations on Modern Supercomputers Using Thousands of Cpus and Hybrid Mpi+openmp Parallelization

This work represents an experience in using the hybrid parallel model to perform large-scale DNS. Advantages of the hybrid approach compared to the MPI-only approach are presented and discussed. The use of OpenMP in addition to MPI is demonstrated for modelling of compressible and incompressible flows using both structured and unstructured meshes. A parallel Poisson solver for incompressible fl...

متن کامل

A Hybrid Parallelization of AIM for Multi-Core Clusters: Implementation Details and Benchmark Results on Ranger

This paper presents implementation details and empirical results for a hybrid message passing and shared memory paralleliziation of the adaptive integral method (AIM). AIM is implemented on a (near) petaflop supercomputing cluster of quad-core processors and its accuracy, complexity, and scalability are investigated by solving benchmark scattering problems. The timing and speedup results on up ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015